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CN '. 1 Introduction 

^ ' Recently, various quantum information-processing technologies have been proposed. Quantum sta- 
, tistical inference is included in these technologies. It is a feature of quantum information processing 
CN ■ that any information is described by a small number of quanta (i.e., particles that behave in quantum 
mechanics). In traditional information processing that can be considered classical, any media com- 
municating information can be considered to behave in classical mechanics: In such circumstances, 
from the viewpoint of quantum mechanics, any information can be described by quanta in order of 
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^0 , the Avogadro number, so we can extract necessary information almost without any state reduction. 

■ However, we need to consider the state reduction by measurement in a microscopic world where we 
O ' 

cannot apply the classical approximation, that is, any information of interest is not described by a 
CLi] sufficiently large number of quanta. 

In such circumstances, it is required to extract information from obtained data efficiently. Gener- 
ally, investigations on optimizations of data processing (e.g., estimating process) are called statistical 
CJ^' inference. In particular, quantum systems have a problem due to measurement, which demolishes the 
state describing the information. Therefore, we can measure the system only once. (Any outcome 
of any repeated measurement can be described by an outcome of single measurement.) For suitable 
estimation, we need to optimize our measurement. Furthermore, noncommutative physical quantities 
(observables) cannot be measured simultaneously. This fact makes the problem more difficult. In 
future, we can expect that this framework is applicable to nanotechnology and information processing 

*The original Japanese version of this manuscript was published as an introductory article of quantum estimation in 
Bulletin of Mathematical Society of Japan, Sugaku, Vol. 55, No. 4, 368-391 (2003); Received September 2, 2002, Published 
October 24, 2003. It was translated to English by Michikazu Sato and Masahito Hayashi. The essential content of this 
manuscript is the same as that of the original version, while several descriptions and references are improved. The 
material in this paper was presented in English in part at Joint MaPhySto and QUANTOP Workshop on Quantum 
Measurements and Quantum Stochastics, Department of Mathematical Sciences, University of Arhus, Denmark, 2003, 
Non-locality of Quantum Mechanics and Statistical Inference, Kyoto Sangyo Univ., Kyoto, 2003, and Special Week on 
Quantum Statistics, Isaac Newton Institute for Mathematical Sciences, Cambridge, UK, 2004. The author was with 
the Laboratory for Mathematical Neuroscience, Brain Science Institute, RIKEN, Wako, Saitama, 351-0198, Japan. He 
is now with ERATO-SORST Quantum Computation and Information Project, Japan Science and Technology Agency 
(JST), Bunkyo-ku, Tokyo, 113-0033, Japan, and Superrobust Computation Project, Information Science and Technology 
Strategic Core (21st Century COE by MEXT) Graduate School of Information Science and Technology, The University 
of Tokyo, Bunkyo-ku, Tokyo, Japan (e-mail: masahito@qci.jst.go.jp). 



1 



systems that are excessively integrated. This formulation is called quantum estimation, which started 
in 1967 in relation to studies on optical communications from the quantum viewpoint. 

On the other hand, uncertainty in quantum mechanics has attracted and fascinated many people. 
It is natural awareness of the issues to formulate the uncertainty, which is peculiar in quantum systems, 
and to aim to rigorous discussion. In such circumstances, quantum estimation can be regarded as a 
theme that formulates the uncertainty in quantum mechanics with mathematical rigor. Furthermore, 
as we will mention in §2, owing to mathematical difficulty for quantum systems (or noncommuta- 
tivity), quantum estimation requires higher mathematical theories that are not used in traditional 
statistics. On one hand, quantum estimation is a mathematical theory based on experimental facility 
and statistical estimation, a kind of engineering framework; on the other hand, it is attractive as a 
basic problem of quantum mechanics and a new mathematical theory. Such multiplicity in quantum 
estimation is an obstacle for going on the study, while it is also rare attraction in this field. 

Now, we will briefly pursue its development historically. In 1967, Helstrom [41] started a study of 
quantum estimation introducing the symmetric logarithmic derivative (SLD). In 1970s, excel studies 
are made: Yuen and Lax [94] studied estimation for a complex amplitude of coherent light in thermal 
noise and Holevo [47] studied estimation for the family of the quantum Gaussian states. Furthermore, 
Holevo introduced the right logarithmic derivative [47, 50], formalized group-covariant parametric 
estimation [49, 50], and so on. In the beginning of 1980s, the studies in this field stagnated, but 
later in 1980s, the studies from a viewpoint different from those in 1970s began and developed rapidly 
[35]. For example, we can list estimation in a two-parameter model of quantum two-level systems by 
Nagaoka [66], estimation in a three-parameter model of quantum two-level systems by Hayashi [20, 21], 
and an approach to estimation theory in quantum two-level systems by evaluating the total Fisher 
information by Gill and Massar [15]. On the other hand, Fujiwara and Nagaoka [4, 5, 8], Hayashi [21, 
22], and Matsunioto [59, 61] analyzed in detail the cases where it is known that the unknown state is a 
pure state. In addition, Hayashi [23] applied irreducible representation theory of general linear groups 
[90] to statistical inference for quantum systems. Through detailed discussions, Matsumoto [63] derived 
the bound of estimation error for arbitrary finite-dimensional models. Furthermore, Nagaoka [64, 67] 
and Hayashi [28] discussed the state estimation from the viewpoint of large deviation, where two 
quantum versions of the Fisher information arc compared and considered from the unified viewpoint. 
On the other hand, in the estimation of eigenvalues of the unknown density matrix, we have only 
one quantum version of Fisher information. Matsumoto [63] and Keyl and Werner [54] discussed 
this problem using the irreducible representation theory of general linear groups, which we mentioned 
before. The latter treated only large deviation, while the former treated also the mean-square error. 
In quantum estimation theory, it is a theme to improve the accuracy of estimation by using quantum 
correlation in the measurement apparatus. As the first study in this direction, Hayashi [21] treated 
simultaneous estimation of the complex amplitude parameter and the photon number parameter in 
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a quantum Gaussian state. Besides, Hayashi [29, 32] showed that, use of correlation in measurement 
apparatus improves the accuracy of estimation of eigenvalues in the quantum two-lcvcl system under 
the assumption that any measurement is non-adaptive. Here, in an implementable framework, he 
discussed improvement using correlation in measurement apparatus over the case without correlation 
in measurement apparatus. In fact, the experiment proposed by the papers [29, 32] has been already 
implemented by Hendrich et al. [43]. However, they concerned only the estimation by this method 
and did not concern the precision (MSE) of the estimation. Furthermore, Fujiwara [9, 10] applied the 
theory of estimation to a problem of estimating the quantum communication channel. 

On the other hand, Kwiat et al. [11, 12, 13, 14, 91] first made statistical estimation for the quantum 
state generated by an actual optical system. Usami et al. [87] demonstrated more accurate estimation 
based on fewer data generated by numerical experiment. In future, more accurate estimation would 
be implemented^. 

In this article, according to the organization mentioned below, we will treat the former studies 

from the unified viewpoint using the quantum central limit theory and so on. In §2, we will outline 

the relation between quantum estimation and its related fields including the historical process. In 

§3, we will summarize the formulation in quantum mechanics needed in minimum to treat quantum 

estimation. In §4 and §5, we will explain the quantum Gaussian states, which can be regarded as 

the quantum version of Gaussian distributions, first for the one-mode case, second for the multi-mode 

case. Then, in §6, we will describe the quantum central limit theorem and explain similarity and 

difference with the central limit theorem in classical systems. The part until §6 is the preliminary 

stage for the statistical discussion, and after this part we will have a particular discussion for statistical 

inference. In §7, considering readers who are not familiar with classical (i.e., with respect to a family of 

probabilities) statistical inference, we will explain the formulation and the idea of statistical inference. 

There, we will explain that classical estimation can be regarded as a particular case of quantum systems 

mathematically: quantum estimation can be regarded as a generalization of classical estimation theory. 

Then, in §8, we will summarize known results in classical systems. The following sections will treat 

their extension to the quantum case. First, in §9, we will discuss estimation only for the expectation 

parameter of the family of the multi-mode quantum Gaussian states. Holevo [47] mainly formulated 

this estimation, which can be treated relatively easily. In §10, we will treat estimation theory for 

a general family of quantum states. Our methods of estimation will be divided into two classes by 

presence or absence of using quantum correlation in the measurement apparatus, and we will show that 

both bounds of estimation error do not coincide. We also will derive the general forms with respect to 

the bounds of estimation for both. In particular, in §11, we will focus on the simultaneous estimation 

of the expectation parameter and the photon number parameter in the family of one-mode quantum 

'^FoUowing these studies, Hayashi et al. established an optimization theory of measurement for testing entanglement 
[40] and applied it to entangled photon pairs generated by SPDC [39]. Also the paper [40] describes the experimental 
framework in more detail. 
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Gaussian states. The proposed estimator will illustrate what quantum correlation in measurement 
apparatus is suitable for effective estimation. In §12, wc will describe the role of the quantum central 
limit theorem to derive the bound for the errors of estimators when quantum correlation is used in 
measurement apparatus. Finally, in §13, we will describe themes in future based on these results. 

In this article, we will use somewhat complicated mathematical description. Operators on a Hilbert 
space corresponding to the quantum systems are denoted in capital letters, while constants are denoted 
in small letters. Furthermore, besides the Hilbert space, we often consider finite-dimensional vectors 
or matrices, which are denoted in small boldface, while their components are denoted in small lightface 
with a suffix (or suffixes, which we will not notice later). In addition, we sometimes consider vectors or 
matrices whose components are operators. They arc denoted in capital boldface and their components, 
which are operators, arc denoted in capital lightface with a suffix. When a vector or a matrix itself 
has a suffix, to denote its component, we write the suffix expressing its component first and after a 
semicolon (;) we write the suffix that the vector or the matrix itself has. For an Z-dimensional vector or 
matrix a, the vector or the matrix of the components from the first to the dth {d < I) is denoted as a^- 
The similar notation is used for vectors or matrices whose components are operators. Furthermore, 
for operators X and F, we write [X, Y] := XY -YX andXoY := {XY + YX)/2. 

2 Relation to other fields 

Quantum estimation can be regarded as a kind of mathematical statistics of quantum systems. Main 
results in this field were not obtained with a mere combination of quantum mechanics and mathe- 
matical statistics, but they were obtained first with further detailed discussions. Here, referring its 
historical progress and its relation to quantum mechanics, we will explain its connection with related 
fields including mathematical statistics and quantum information theory. 

Quantum theory, which describes a microscopic world, began in 1900 when Plank's constant was 
discovered. Then, it was formulated through the proposals of matrix mechanics by Heisenberg, wave 
mechanics by Schrodinger, and probabilistic interpretation by Born. Furthermore, in 1930s, von 
Neumann succeeded in mathematically rigorous formulation with probabilistic interpretation and de- 
scription of state reduction in measuring process [89]. Probabilistic interpretation is essential for 
statistical inference in quantum systems because the estimation error and the error probability are 
concerned for this topic. However, von Neumann's formulation causes various problems because he 
adopted different principles in time evolution in a measuring process and usual time evolution. The 
conceptual problems of quantum mechanics, such as the measurement problem, seem to come from 
these double standards. 

In quantum systems, probabilistic interpretation is crucial when we measure one quantum (or a 
small number of quanta). In many actual physical experiments, however, observers measure quanta 
(e.g., atoms or molecules) with around Avogadro's constant (ca. 6x 10^^) simultaneously, so they obtain 
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only the average of the measured value directly. Owing to this reason, many studies based on quantum 
mechanics concern only the expectations of observables and rarely concern the probability distribution 
of measured data. On the other hand, while there are many topics whose name contains "quantum" 
in mathematics, they focus only on higher mathematical difficulty such as infinity in quantum field 
theory, rather than problems related to probabilistic interpretation in quantum mechanics. Hence, 
statistical inference in quantum system is different from many traditional studies in that it targets 
not only the expectations of the measured data but their probability distributions themselves. In this 
mathematical stream with quantum mechanics, theory of Hilbert spaces was formulated and theory of 
operator algebra followed. Then, entropy theory [76] was expanded in relation to statistical mechanics, 
and furthermore it developed to quantum probability theory. Then, the quantum central limit theorem 
[16], which is a theme in this article, was obtained by following these achievements, and it developed 
into free probability theory. 

Here, we will briefly mention the relation between mathematical statistics and probability theory. 
Both fields treat the idea of probability, but it seems that they concern different aspects of probability. 
The former treats optimizations in some sense, e.g., the optimization of a random variable (an esti- 
mator etc.) for purposes of estimation and so on, while the latter takes an interest in mathematical 
structures. However, the central limit theorem has a basic position in both fields. In the former, it 
is widely accepted as a fundamental fact that the maximum likelihood estimator is asymptotically 
optimal and is asymptotically distributed as the normal distribution whose variance is the inverse of 
the Fisher information (or the Fisher information matrix) [57, 88]. This is essentially because the cen- 
tral limit theorem guarantees that the logarithmic differentiation of the probability density function 
is asymptotically distributed in the normal distribution as the number of data increases. Thus, the 
basic result of probability theory is effectively applied to mathematical statistics. 

On the other hand, a quantization of random variables was done in quantum probability theory. 
However, its meaning is not clear from the viewpoint of simultaneous measurement of plural observ- 
ables because it concerns not probabilistic structure but algebraic structure. Therefore, its result could 
not be applied to mathematical inference for quantum systems directly. At least, the quantum central 
limit theorem had not been considered applicable to evaluation of estimation error directly, as the 
classical one had. As we will describe in §12, however, through the quantum central limit theorem, 
we can prove that a suitable estimator attains the asymptotically minimum error under the condition 
that quantum correlation is allowed in measuring apparatus. 

While Matsumoto [63] obtained a similar statement for a general model in an imperfect form, our 
derivation is more natural and more general owing to the quantum central limit theorem. In this 
discussion, the key is to construct an estimator in a somewhat technical form and to combine it with 
the quantum central limit theorem. 

Quantum information theory is known as a field to discuss the quantum coding theorem and so 



5 



on. In the latter 1960s, investigators in USSR began quantum information theory to find out how 
efficiently classical messages can be transmitted through a quantum communication channel (coding of 
a quantum communication channel). They include Holevo [46, 48], Levitin [58], and Stratonovich [84], 
who is one of the founders of stochastic differential equations. This problem consists of both optimiza- 
tions of the encoding process and the decoding process. The latter optimization can be regarded as a 
decision problem of unknown quantum states, and Yuen, Kennedy, and Lax [93] formulated theory of 
the optimum receiver for given signals. Thus, as is different from quantum probability theory, quan- 
tum information theory has developed since early time, related closely to mathematical statistics for 
quantum systems similar to quantum estimation. Holevo obtained an upper bound for the communi- 
cation rate of the quantum communication channel in his two 1970s' papers [46, 48]. After no less than 
twenty years, in the late 1990s, Holevo [51] and Schumacher and Westmoreland [81] independently 
proved that his upper bound can be attained. While this result requires the assumption that the com- 
munication channel be stationary memory less. Hayashi and Nagaoka [38] recently obtained a coding 
theorem for general quantum communication channels that are not necessarily stationary memoryless. 
In the decoding process to achieve the optimal rate, so-called the square-root measurement is used in 
these results. Hayashi and Nagaoka [38] introduced an important inequality for evaluation of the error 
probability of this decoding. Indeed, in quantum estimation, the square-root measurement is used for 
constructing an asymptotically optimal estimator, as we will mention in §12 of this article. Jointly 
with the quantum central limit theorem, this inequality plays an important part in the evaluation for 
the error of the proposed estimator. Note that Holevo [52] made detailed explanation on topics about 
the coding theorem of a quantum communication channel. 

In the first half of 1990s, many researchers appeared in North America and Europe from related 
fields in quantum information theory. Then, besides the coding theorem of a quantum communication 
channel mentioned above, the following problems began to be discussed: transmitting the quantum 
states themselves, compressing the quantum states, and so on. In particular, compressing the quantum 
state, which Schumacher began, is deeply related with quantum estimation, so we will explain it here. 
This protocol, which depends on the density operator of this ensemble, is said to be fixed-length 
because its rate of compression is fixed in advance. Jozsa and the Horodecki family [53] proposed 
a fixed-length compression protocol depending only on the compression rate, which works for any 
stationary memoryless ensemble with the smaller von Neumann entropy than the compression rate. 
This protocol is called the universal fixed-length compression protocol. 

Furthermore, Hayashi and Matsumoto [36] proposed a compression protocol that works for any 
stationary memoryless ensemble. In such a protocol, the rate of compression should not be fixed in 
advance, i.e., a variable- length protocol is needed. In this compression protocol, a compression rate 
needs to be decided based on the input quantum state but then measurement is inevitable. If we 
assume only a stationary memoryless condition, the state demolition is inevitable, so it is impossible 
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to decode a message without any error. Thus, in variable-length compression, we need to treat the 
tradeoff between state demolition and estimation of the compression rate. Asymptotically, however, we 
can estimate the compression rate as precisely as the optimal case with scarce state demolition. This 
protocol is called universal variable-length, and is constructed based on state estimation and Weyl's 
dual representation theory of permutation groups and general linear groups on a tensor-product space 
[90, 17]. As a commentary of quantum information theory including transmission of quantum states, 
see, for example, Hayashi [30]. 

Next, we go on to another problem in quantum statistics inference, statistical hypothesis testing 
on quantum states, in which we assume two hypotheses for the unknown quantum state, and treat the 
tradeoff between two kinds of error probabilities. Nagaoka focused on this problem when each hypoth- 
esis is of a single quantum state (i.e., a simple case), and proposed a quantum information-spectrum 
method [69] , which is the quantum version of information-spectrum methods [19] . Testing two simple 
hypotheses is formulated by the Ncymann-Pearson fundamental lemma in a very simple form with the 
non-asymptotic framework for classical cases. Holevo [45] made its quantum extension in 1970s. In 
the classical case, Stein's lemma is known as fundamental fact that describes the asymptotic behavior 
of the optimal Type II error probability with the constant constraint for the Type I error probability. 
Its quantum version was obtained by combining results of Hiai and Petz [44] and Ogawa and Na- 
gaoka [72]. Furthermore, Hayashi [27] characterized quantum measurement attaining the asymptotic 
optimal performance in the sense of Stein's lemma. This characterization was made with irreducible 
representation theory of general linear groups mentioned above. Interestingly, it depends only on one 
of two hypotheses. Ogawa and Hayashi [75] analyzed the cases with an exponential constraint on 
one error probability. Furthermore, Ogawa [73] and Hayashi and Nagaoka [38] found a close relation 
between the quantum version of Stein's lemma and the quantum channel coding theorem described 
before. Besides, using information-spectrum methods, Nagaoka and Hayashi [71] formulate an infinite 
sequence of tests of general simple hypotheses. As a by-product of this formulation, they obtained 
theory in some extent to a general ensemble of quantum states that does not satisfy the stationary 
memoryless condition. In addition, extending theory of quantum hypothesis testing, Nagaoka formu- 
lated estimation theory based on large deviation type evaluation [64, 67]. Hayashi showed that two 
types of quantum Fisher information gave the bounds of the estimation error in two subtle different 
formulations [28]. 

Finally, we will describe the relation of quantum estimation with information geometry, which 
Amari and Nagaoka [1, 2] formulated by regarding statistical estimation as geometry of a family of 
probability distributions. They introduced the idea of dual connection first, which plays an important 
role. Nagaoka [65, 68] started a quantum extension of information geometry, and Petz et al. [79] 
characterized the quantum version of Fisher information geometrically (for details, see Amari and 
Nagaoka [2]). Petz studied one type of quantum version of Fisher information, called Bogoliubov 
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Fisher information, and showed that it is the most natural among several quantum versions of Fisher 
information from the geometrical viewpoint. As Nagaoka [64, 67] and Hayashi [28] discussed in detail, 
however, it turned out that such a geometrically natural Fisher information does not give a bound 
meaningful for estimation theory. Thus, it seems failed to extend the geometrical characterization of 
statistical estimation by dual connection to the quantum case, while this scenario might be expected 
in a naive sense by several researchers. Some quantum expansions of information geometry have been 
discussed only from the geometrical viewpoint, hut it seems necessary to grasp them synthetically 
including the viewpoint of estimation in future. As a geometrical study including the viewpoint of 
estimation theory, interpreting Berry's phase [83] as "strength of noncommutativity," Matsumoto [59] 
pointed out the relation between the bound of estimation error and Berry's phase in a family of 
pure states. Fujiwara [3, 7] and Matsumoto [60, 61] developed this study to an internal relation 
between Berry's phase — and its extension, Uhlmann's parallelism [85, 86] — and Nagaoka's quantum 
information geometry. 

3 Quantum mechanics, especially states and measurement 

To discuss statistical inference of quantum systems, it is necessary at least to describe the probabilities 
of observed values in measurement. However, though there are many textbooks about quantum 
mechanics, few of them describe the probabilities of observed values simply and rigorously. Though it 
is somewhat intrusive, we will explain the least items needed for description of probabilities of observed 
values in measurement. A theoretical description of state reduction by measurement has been already 
formulated rigorously, but we will not refer it here. See, e.g., Ozawa [77] or Hayashi [30]. 

First, a physical system of interest corresponds to a Hilbert space (a finite- or infinite-dimensional 
vector space with a Hermitian inner product), which is called its representation space. The system 
state is described with an operator p called a density operator satisfying 

p = p\ Tvp = l, p>0. (1) 

Furthermore, measurement on the system is described with a set of operators M := {M^jt^gfj satisfying 

M^ = M*, M^>0, ^iV4, = i-, (2) 

wen 

where M satisfying the above is called a positive operator-valued measure (POVM). (Those who 
have read a standard textbook of quantum mechanics should note that this is not necessarily 
a projection.) Here, / is the unit operator. We allow uj to take continuous values and ^ signifies 
integration then. Unless it is confusing, we do not explicitly specify in which uj takes values. 

The density operator and the POVM defined above are mathematical representations of the state 
and the measurement of the system, respectively. Its concrete implication is given as follows: if we 
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perform the measurement corresponding to M = {M^}i^^q to the state corresponding to p, then the 
probabihty to get lo is 



In this formulation, reproducibihty of both the state and measurement is implicitly assumed (other- 
wise, it may be almost impossible to verify the formula (3) experimentally). We will call the system 
H, the state p, the measurement M for short. 

According to the convention of physics, when we regard each an element u G 7^ as an element of 
the dual space H* through the inner product, we will denote it by {u\. When we want to emphasize 
that u is not an element of H* but is an element of H, we may write \u). In other words, when we 
regard \u) = {uiU2 ■ ■ E H as a column vector, the transposed vector with its complex conjugate 
elements is denoted as (n| = {ulu2 ■ ■ ■)■ In the following, we denote the set of density matrices by 
S{H), and we will explain its structure. If the system state is pi with probability A and p2 with 
probability 1 — A, by performing certain measurement M = {M^} to the system, the probability to 
get an observed value cj is 



Here, even if we regard the state of this system as p' = Xpi + (1 — A)p2 and calculate the probability 
distribution of the data according to (3) , no problem occurs on integrity with all theoretically possible 
experiments. Therefore, the state of the system can be regarded as p'. This is called probabilistic 
mixture (incoherent superposition) and is distinct from so-called quantum superposition. Furthermore, 
when the system is the composite system of systems Hi and H2, then the representation space is given 
by the tensor-product space Hi'S>H2- 

For the discussion above, we may disregard the existence of the representation space and need 
only treat a *-algebra, which is a generalization of the linear space of operators as follows: A complex 
linear space V is called a *-algebra when it has the unit element I, a product satisfying the linearity 
and the associativity, and a *-operation satisfying the following condition: (ex)* = c*x* for any x eV 
and any complex number c. For such a *-algebra, using the set of positive elements {xx*|a; eV}, we 
can reconstruct the description of quantum systems mentioned above as follows: First, a measurement 
is given by a decomposition of the unit element / by positive elements, that is, a generalized POVM 
M := {M^}^. A state is given by an element p of the dual space V* of V satisfying p{I) = 1 and 
p{xx*) > 0, that is, a generalized state. Then, when we perform the measurement M to the system in 
the state p, the probability to get an observed value u is given by p{M^). Note that, if x* = a; holds, 

^ gx/2(gx/2^* g |^^*|^ g y| because (e^/^)* = e^/^. Of course, if F is a space of linear operators 
on 7^, then the POVMs (the states) defined by (2) ((!)) coincide with the generalized POVMs (the 
generalized states) given here, respectively. Thus, we can regard the *-algebra F as a quantum system. 



Vf{uj) :=TVpM,. 



(3) 



ATrpiM^ + (1 - A)Trp2M^ = Tr{(Api + (1 - A)p2)M^}. 



(4) 
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Furthermore, the composite system of *-algebras Vi and V2 can be described with a *-algebra on the 
tensor-product space Vi ® V2 in which the following product and the *-operation: 



{xi ^yi) ■ {x2^y2) ■= {xi ■ X2) {yi ■ y2), {xi ^ yi)* := xl<Si yl- (5) 

When V is the set of linear operators on the Hilbert space, this tensor-product of *-algebras Vi and 
V2 coincides with the set of linear operators on the composite system defined in the former. 

4 One-mode quantum Gaussian systems 

As a concrete example of quantum systems, we consider a Hilbert space -L^(M), which is often called 
the Fock space. By denoting by \k) the A;th Hermitian function, {|0), . . .} forms its orthonormal basis. 
This space describes the physical system of photons with a specific wavelength, and the state \k){k\ 
represents the state of k photons. 

In this system, operators Q and P are defined by 

(Q/)(x) := xf{x), (P/)(x) := -i^ix). (6) 

These operators play an important role. They are called the position operator and the momentum 
operator when the quantum system represents a one-dimensional motion of a particle. They satisfy 
the commutation relation 

[Q,P]=iI. (7) 

By defining a coherent vector \a)a ■= J2h=o^~^~^ ^^\^) ("^ ^ '^)' they have the following relation 
with the annihilation operator a := ^{Q + iP): 

a\a)a = a\a)a- (8) 

In addition, the density operator p^fl '■= \o)a a{ct\ is called a coherent state. Two POVMs 



N : A;^ |A;)(A;| (9) 



H : a \a)a a{a\ (10) 



are known as important measurements, and are called the number measurement and the heterodyne 
measurement, respectively. The former takes integer values, which are discrete; the latter takes 



complex values, which are continuous. We can show that H is a POVM by the equality 



/ |a)a o(a| dx = I. (11) 
Jc 

A coherent state is a relatively stable state and plays an important part in quantum optics. In 
particular, in this article, we will note a quantum Gaussian state that is a probabilistic mixture of 
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coherent states by Gaussian integral p(^^N ^ Jc^ ^ l*^)" "^^^l "^^^ quantum Gaussian state 
p^^jv can be described with Q and P as fohows: 

exp(log(^)(a -C)(a-C)) exp ^^log^^^J 2 J 

^^'^ = = ^+1 ' ^^^^ 

where C = Ca; + ^Cj/- Furthermore, it is known that the quantum Gaussian state p^^^r satisfies the 
equation 

IV p^,jv exp(i(xQ + yP)) = exp (^^( V2Ca:X + V2Cj,y) - ^ + ^ + y')) > Vx, y e M. (13) 

Conversely, it is also known that a density operator satisfying (13) is restricted to a quantum Gaussian 
state pc^^N- Therefore, we can also regard (13) as the definition of pc^^N- 

It is known that any operator of this system can be given as the limit of sums of algebraic terms 
of Q and P [50]. Therefore, we can treat the quantum system described by the Hilbert space L^(M) 
based only on the operators Q and P satisfying the commutation relation (7) without considering the 
structure of the original Hilbert space L^(M). 

5 Multi-mode quantum Gaussian systems 

The Fock space above describes a physical system corresponding with one wavelength. A quantum 
system of photons of plural I wavelengths is described by a Hilbert space L?{W^) = L^(M)®'". That is, 
this quantum system can be described by the space of operators generated by a sequence of self-adjoint 
operators Q^, . . . Q"*, P^, . . . , P"* satisfying 

[p^ p^] = [Q^ Q^] = 0, [Q^ p^j = iskS'^^n. (m) 

We will more generally consider a system defined by a sequence of self-adjoint operators satisfying 

=is*^'^/ (15) 

(1 ^ k,j < d). Of course, s = [s'^'^] is an antisymmetric matrix. As generalization of (13), for a vector 

6 = [O'^] and a symmetric matrix v = [v'^'^] satisfying v > and v + is > 0, we define a quantum 
Gaussian state pe^^, as the state satisfying 

Tr p0^^ exp j i ^ j = exp [ i ^ i]kO'' - ^ '"'''^'^kVj ) • (16) 
\ k J \ k j,k J 

It is known that the condition above is equivalent to the following [78] : 

Tt pe^^X'' = e\ (17) 

TrpBAix''-o'^){x'^-e'^)---{x'^-e'')) = !^^^^^^^^^^^^^^ ^I'r'' ^'^^ 
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In particular, for / = 2 in (18), the right-hand side is equal to v'^'^. Note that ^ signifies the sum 
with respect to all kinds of decomposition {Hi, ... , Hn} of {1, 2, ... , 2n} of the form = {kh,jh}j 
where jh < kh- 

The existence of X and pg^^ can be shown as follows: First, assume that d is even and the rank 
of s is d. Then, we select an adequate orthogonal matrix so that ava* is the unit matrix and asa* 
satisfies (14), where a := ov~^/'^. Therefore, we can construct the sequence X of operators (the state 
P0,v) cts operators (a state) on the representation space L^(M''/^), respectively. Next, we consider the 
case when the rank of s is less than d (for an odd d, this holds by antisymmetry) . If the rank of v is d, 
adding adequate operators . . . , x'^'^~'^^^^ ^ , we can reduce this case to the former one. If the rank 

of V is also less than d, we change the coordinate adequately so that v and s have non-zero components 
at only the first rankv components. Then, we can regard x^'^^^'"'^^, . . . ^X*^ as the constant and 
the rest elements X^ , . . . ^x^^'^^^'" can be reduced to the former argument. In the following, we will 
disregard the structure of Hilbert space and discuss only operators characterized by (15), (17), and 
(18). 

Here, we define the operator Tgiy for a vector X consisting of operators as follows: 

exp (-{X - e'nvT'^f{\{vT'^s{v')-'^\){v')-kx - e')) 

r0>'(X):= ^ , , , — ^, (19) 

(27r)2(det('u'))2 det(l - \{v')-2s{v')-2\4) 

where the function / on [0, 1) is 

Here, when the variable of / is a matrix, / is defined as a matrix function, i.e., the function / is applied 
to the diagonal elements. Since / is defined on [0, 1), we need to select v' so that all eigenvalues of 
\{v')~'2s{v')~2\ are less than 1. 

Then, for a multi-mode quantum Gaussian system, we can prove 

/ Te',^'{X)de' = 1, (21) 



exp 



TrTe>y{X)pe,^ = ^ , (22) 

{2tt)2 {det{v + v'))2 

Therefore, (19) and (21) show that Tg/y^X) gives a generalized POVM. Furthermore, for a sequence 
Xd' of the d operators X^,. . . ,X^', a d'-dimensional matrix v', and a d'-dimensional vector 6', we 
define TgrylXdi), similarly to the above, by 

exp (-(X,, - e'r{vT'^fi\ivT'^s{v')-'^\){v')-kxa' - e')) 

Tg>,v'{Xd') := J, 1 J — . (23) 

(27r)^(det(u'))2 det(l - \{v')'^ Sd'{v')~2\4) 

Then, similar equalities to (21) and (22) hold and Tffiy{Xd') is thought to give a generalized POVM. 
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6 The quantum central limit theorem 



For a density matrix p on we choose a self-adjoint operator X satisfying Ti pX = 0, and denote by 
the self-adjoint operator ^ YJj=i ^{j) on K®", where means I®---®I®X®I®---®I. 
Here, consider a vector consisting of self-adjoint operators X = {X^ , . . . satisfying Ti pX^ = 0. 
Then, matrices v'p'^X) := TipiX^oX^) and Sp'^'(X) := ^ Tr X^] satisfy ^p(X) +iSp(X) > 0. 
Concerning X^'^"^^ and a density matrix p®" := p (g) • • • (g) p on K®", the following theorem holds: 



Theorem 1 For X, we define a vector X consisting of operators on a multi-mode Gaussian system 
satisfying [X^ ,X''\ = isp''' . Then, the following equality holds: 

hm TVp®"P(x(")) = Trpo,.^P(X), (24) 

n—^oo 

where P is an arbitrary polynomial and po^vp is a quantum Gaussian state on X. 

This theorem is called the quantum central limit theorem (or the algebraic central limit theorem), 
which Giri and von Waldenfels [16] proved first, and Petz [78] serves as its introductory references. For 
an operator Y, we will denote by Y the corresponding operator on the multi-mode Gaussian system 
in the sense of the central limit theorem above. In this theorem, we regard a sequence of operators 
Xi, . . . ,Xd as a quantum version of sequence of random variables, like the classical case. Then, the 
expectation of their product converges to the expectation of the product for the Gaussian systems. 
However, it does not show that the operators Xi, . . . , X^i are asymptotically commutative in some 
sense, so it is difficult to find out what kind of measurement is available in the asymptotic setting only 
from this theorem. 

On the other hand, the equality (21) guarantees that the operator Tgi ,^i{X) can be regarded as a 
measurement in the multi-mode Gaussian system. Since the operator Tg'y{X^^^^) can be characterized 
as the limit of the polynomial of operators X^"'\ using Theorem 1 and adequate discussion of analytic 
prolongation, we can show the following theorem[33]: 

Theorem 2 If Ti pex.p{t{X^)'^) < oo for i = l,...,d < I and sufficiently small t > 0, then the 
following equalities hold: 

hm Trp^"T^, (X^")) = Tr po,^,T0, y (X d) , (25) 

n — ''■CO d' d ^ d^ d 

hm TVp«"X^-^(«)r«,,,,(x(")) = Tvpo,^,X'Te>,,(X,). (26) 

Tl — >00 d a 

Hence, we can expect to apply these discussions to estimation theory where measurement is concerned. 

7 Statistical inference in quantum systems 

When a quantum system is generated from a new particle generator or we lack a part of information 
describing the system, the density matrix of the system is not a priori known, so we infer the density 
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matrix from the data of an experiment designed adequately. In particular, estimating the unknown 
density matrix is called quantum estimation. When we statistically infer the density matrix of the 
system, we need to prepare plural systems of the same state. For example, when the particles are 
repeatedly generated by a certain particle generator under the same condition, the states of these 
systems may be regarded almost as the same states. 

Under such a framework, from the prior information, we often assume that the unknown density 
matrix belongs to a certain family of density operators (matrices) S = {po G S{T-l)\G € C W^}. 
Then, estimating the density matrices are equivalent with estimating the unknown parameter from 
observed data. 

For example, when Tt = C'^, Tinless we have prior knowledge, we need to estimate the parameter 
(x, y, z) in the family of density matrices 



1/ \ + X y + iz 



Px,y,z 2\y-iz 1-x 



l>x^ + y^ + z^>0 



For the same system, if it is known (or can be considered) that Trp ( ^. ^ 1=0, then we can 



-i 

assume that the unknown density matrix belongs to the family of density matrices 

Therefore, we need only estimate the unknown parameter (x, y) from the observed data. 

As another example, consider a system of photons with the frequency /. This system is considered 
to develop as time evolution according to the following Master equation [82] : 



dp 
'dt 



u [a''a+^] ,p 



— {aa*p — 2a* pa + paa*) ^ — ^ — ^-{aa*p — 2apa* + pa* a), 



where c is the coupling constant with the environment system and n is the average photon number 
in the environment system. Thus, the coherent state \Co)a a{Co\ evolves in time as \Co)a a{Co\ 
p ct , . Since the coherent state is natural for the initial state, it is natural to assume 
that the unknown final state belongs to the one-mode quantum Gaussian states family. If N is known, 
then the problem is estimation of the two-dimensional parameter (Ci,C2) in the family of density 
operators Sff := {p(,n\C{= Ci + C2i) G C}. If N is unknown, then the problem is estimation of the 
three-dimensional parameter (Ci, C2) in the family of density operators := {P(;^n\C & C,N > 0}. 

More generally, if the system is H, measurement is described by a POVM M on H. In particular, 
when we estimate an unknown state that belongs to a family of density operators S = {pg G S(7i)\6 G 
6 C W^}, we need a mapping from the set $7 of observed values to the parameter space G besides 
a POVM. The joint process of this measurement and this mapping can be described by a POVM 
that takes values in the parameter space 9, which is called an estimator. We often discuss statistical 
inference under the assumption that n unknown states identical to pg are prepared independently. 
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Then, the total system is written by the Hilbert space H®", and its density operator is Pg. Hence, 
we treat estimation concerning the family of density operators {pf" G S{J-i)\6 G 6 C M*^}, and any 
estimator is given by a POVM on "H®". 

In statistical inference of quantum systems, now, we assume that one measurement M is chosen 
and cannot be changed. Then, estimation of the density matrix can be reduced to estimation in a 
family of distributions {Ppg\0 G O}. To investigate discussion peculiar to quantum system, we have 
to discuss optimization of measurement M. This can be also considered a kind of experimental design. 

For example, when the unknown state is pe and an estimator M is applied, the kih. element e^(M) 
of the mean of the estimate eg{M) is equal to 

e^(M) = / e''TipgM{de) (27) 
Je 

and the mean-square error matrix vq{M) = [vq\M)\ becomes 

vl'\M) = j {6^- e^){e^ - e^) tt pqM{ m). (28) 

In statistical inference, we mainly evaluate errors with the mean-square error matrix. We often pose 
the condition of unbiasedness 

60 (M) = e (29) 

to estimators. This condition, however, depends on the choice of the coordinates. Therefore, we need 
not always restrict estimators within this condition for discussion. 

When all density operators in the given family are commutative, they can be diagonalized si- 
multaneously, and the optimal measurement is given as the resolution of the unity based on the 
orthonormal system consisting of common eigenvectors. Here, label the common eigenvectors of these 

density matrices by a; = 1,2, Then, diagonal elements pg{oj) form a probability distribution. We 

can reduce our problem to estimation of the unknown parameter in a family of probability distri- 
butions {pe = {pe{uj)}\0 G 6 C R'^}. When the system follows classical mechanics, all states are 
described by probability distributions, so the estimation problem concerning a family of probability 
distributions is said to be classical. Even in such classical cases, it is generally difficult to minimize 
the mean-square error matrix vq{M) for all 6. Therefore, for classical models, we often discuss only 
the cases of sufficiently large n (asymptotic cases). 

Here, we need to explain briefly why the composite system of two same quantum systems is 
represented not by the symmetric or antisymmetric tensor-product space, but by the tensor product 
system H®"^. Generally, even in the "same" experiments in classical setting, their places and their 
times are different. That is, even if the two quantum systems can be regarded as the same, usually 
only the microscopic parts are the same and represented by but other parts (e.g., the position and 
the time) are different. Hence, if the states of these different parts are known and if we focus only on 
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the same system, then the total focused system cannot be treated by Bose statistics or Fermi statistics 
and the representation space is the tensor product space H.®'^ . This is because these two particles can 
be distinguished by the additional parts (e.g., the position). For example, n spin 1/2 particles have 
the same system concerning the spin, but to treat their position, we need another representation 
space. If we know the position of each particle, and if we focus only on their spin, the system of the 
total spin is represented by the n-tensor product space of because we identify each particle. Here, 
it is assumed that plural identical systems are prepared, not that we make copies of the same state. 
It is impossible to clone a quantum state perfectly. 

As we wrote in §1, if the number of particles is quite huge, the average of this ensemble obeys 
classical mechanics. This number is quite different from the number required for the application of 
asymptotic theory of statistical inference. A number large for statistical inference is ten thousand or a 
hundred thousand. On the other hand, it is considered that approximation of classical mechanics can 
be applied when the number is about Avogadro's constant (ca. 6 x 10^^) or its square root. Therefore, 
even if asymptotic theory can be applied, we still need to use the quantum framework. 

8 Asymptotic theory in classical systems 

We will briefly explain asymptotic theory in classical systems. First, for a family of probability 
distributions {pe = {pg{uj)}\0 G 6 C R'^}, we define the Fisher information matrix jq = [jk,i;0] by 

E d log peioj) d log pe{uj) , . 

Po{^) dfk QQi • (30) 

Then, for an unbiased estimator M, the following Cramer-Rao inequality holds: 

ve{M)>j-\ (31) 

In the proof of the inequality (31), the Schwarz inequality plays an essential role. For details, see 
[1, 2]. As we mentioned before, however, we need not restrict estimators within unbiased ones but we 
need to discuss estimators in a wider class. When the number n of the prepared states is sufficiently 
large, it is relatively easy to treat the problem: If n is large, for an estimator M", the mean-square 
error matrix vg{M"') asymptotically satisfies the inequality 

veiM-) > -jg'. (32) 
n 

Furthermore, the maximum likelihood estimator 0ML,n(^i) ■ ■ ■ j^n) '■= argmaxgp^^ • • -pg", satisfies 
the equality (=) when n is sufficiently large. Therefore, for parametric estimation in a family of 
probability distributions, the inverse j^^ of the Fisher information matrix signifies the asymptotically 
optimal mean-square error. For its validity, some conditions on an estimator M" are needed for the 
inequality (32). This fact shows that the mean-square error matrix can be minimized asymptotically. 
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These discussions with sufficiently large n are called asymptotic theory, and its general and rigorous 
discussion has been established for classical cases. To discuss this problem more rigorously, however, 
we need to clarify the conditions for estimators [88]. 

Note that, in the quantum framework, when density operators are noncommutative, the asymptot- 
ically optimal mean square error matrix is not unique even in the asymptotic setting. This is because 
we have to consider a tradeoff between the mean-square errors of respective parameters. Thus, we take 
the strategy of minimizing the trace of the product of the mean-square error matrix and a weighted 
matrix. 

9 Estimation of the expectation parameter in the family of the 
multi-mode quantum Gaussian states 

Here, we discuss the estimation only of the expectation parameter in the family of the quantum 
Gaussian states {p0,v\& G M?^} defined in §5. Since the state of this system is transformed to p^e^^® 
P^^v adequate unitary, the structure of the state family concerning the unknown parameter 

6 is invariant even if the number n of states increases. Therefore, we can apply the discussion for 
n = 1 to the cases of n > 2. 

Here, we will restrict our estimators to unbiased ones. Later, we will go on to the discussion of an 
arbitrary family of quantum states, in which for sufficiently large n, we will treat estimators that may 
be biased. 

Like (31), the following right logarithmic derivative (RLD) Cramer-Rao inequality holds: 

ve{M)>3^},, (33) 
where the RLD Fisher matrix jg n and the RLD Li-g are defined as 



jk,l;0,R ■— "^T^ PeLl;0{-^k;eT , 




— PeLi;e- 



Like (31), the Schwarz inequality plays an essential role in its proof. By calculation, for the family of 
the Gaussian states, the RLD Fisher matrix jg n has the following inverse matrix: 

je}i = v + is. (34) 

Here, taking the trace, we get 

tnve{M) > minjtr a|a is symmetric, a > v + is, and a > 0} = tr(v + |s|), (35) 

so we have a lower bound. On the other hand, by (22), when the state is pe,v, if we perform the mea- 
surement corresponding to a POVM Tjg|^^(X) := {Tg \g\^^{X)}, then the observed value is distributed 
as the normal distribution with the mean and the variance v -|- |s| -|- e. Therefore, we get 

inf{trv0(M)|M is unbiased.} = tr(v -|- \s\) (36) 
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(see [50]). This formulation is generalized to the minimization of the weighted sum of the elements of 
V0{M), in which we minimize the error ti gv0{M) expressed with a weight matrix g > 0. Note that 
there are many quantum extensions of logarithmic derivatives owing to noncommutativity, but this 
definition is the most suitable when we discuss only the family of the quantum Gaussian states. In a 
later section, we will explain its relation with another logarithmic derivative. For derivation of (33) 
and (34), see Holevo [50] or Hayashi [30]. 

10 Asymptotic theory for an arbitrary family of quantum states 

In a similar way to classical systems, it is not easy to optimize our estimator with an arbitrary family 
of quantum states when the number of states n is finite. Thus, we will discuss the case when n is 
sufficiently large, that is, for the asymptotic case: We will treat a sequence of estimators M = {M"} 
and we impose the following asymptotic local covariance condition at each point 0, instead of the 
unbiasedness. Thus, we define the following to represent a distribution of difference between the 
true value and the estimate: 

^ xeB}. (37) 



P^" (5) := Tr p®"M" | ^ + 

Note that P^l{B) is different from P^"{B). If 
Pe 

lim P^"{B) = limP*r^(B), V0{M) = lim nt;0(M") (38) 

•yn n— »oo 

hold for each c G M*^, then M is said to be asymptotically locally covariant at 6, and we will write 
Pf{B) = lim P^"{B). Then, V0{M) is expressed by 



Furthermore, if an estimator is asymptotically locally covariant at all 0, it is simply said to be asymp- 
totically locally covariant. 

In the following, we will treat the value 

C0{g) := ird{ivv0{M)g\M is asymptotically locally covariant at 0}, 

expressed with a weight matrix gr > 0. In particular, it is simplified to Cq when g is the identity 
matrix. 

To perform the measurement corresponding to an arbitrary POVM on 7i®", we need quantum 
interaction among systems, which is usually difficult. As a class of measurements implemented easily, 
we consider the following class illustrated below: Each measurement is performed for each quantum 
system but the selection of measurements is possible adaptively according to the data obtained in all 
measurements before: Depending on the first to {k — l)th observed values (wi, . . . ,uok-i) € O*^"^, the 
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kth measurement M*^(a;i, . . . ,iOk-i) can be selected. This is the same setting as the classical adaptive 
experimental design, and a POVM corresponding to this measurement is said to be adaptive. 



Measurement apparatus Observed value 



Estimate 



Pei- 



P0\- 



Pe\ 





Ml 




> 






\ > 


M2 

















M" 


1 > 


>■ 



Figure 1 

An adaptive POVM can be written as a POVM on 7^®" as 



(39) 



Furthermore, as a wider class of POVMs, we sometimes consider a class whose element M{uj) can be 
expressed as a tensor product. A POVM in this class is said to be separable and can be written as 



{M^{uj)<^M^{uj) 



'M"(a;)}a,. 



(40) 



Like Cg{g), we note 



Cff'^ig) ■= inf{trt;0(M)g|M is adaptive and asymptotically locally covariant at 6}, 
Ce'^(fl') := infjtr ve(M)gf|M is separable and asymptotically locally covariant at 0}. 

By the inclusion relation between these classes, we see Cg'"'{g) > CQ'^{g), but they coincide as we 
will explain later. When g is the identity matrix, they are denoted as C^'" and Cg'^, respectively. 

Next, to discuss CQ{g) from different viewpoints, we consider local unbiasedness as a weaker con- 
dition than the unbiasedness condition: Approximating the unbiasedness condition in a neighborhood 
of 6q G 6, we get 



0=00 



51 



(41) 
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Thus, this is called the local unbiasedness condition at 0o ^ ©• 

Based on the local unbiasedness condition, we define the following value: 

Cg{g) := inf{trt;e(M)sr|M is locally unbiased at 6}. (42) 

In particular, if g is the identity matrix, it is denoted as Cg. 

Generally, in the commutative case, that is, when we consider a family of probability distributions, 
Cgig) = Trjg^g holds: There exists a locally unbiased estimator whose covariance matrix is j^g. 
Conversely, the covariance matrix of any locally unbiased estimator is lower bounded by jg^. Thus, 
for a POVM M, denoting the Fisher information matrix of a family of probability distributions by 
j^, we get 

C'eig) = mf{ti{j^')-'g\M: POVM}. (43) 

In addition, let Cg{g) denote the value Cg{g) of the family of density operators {pg^\0 G &}. 
Comparing their selections of measurements, we can easily show Cg{g) > nCg{g). The following 
theorem holds: 

Theorem 3 By letting vg{X) := Vpg{X) and se{X) := Spg{X), the following equalities hold: 

C^{g) = l^nC^ig) = C^{g) := mm |tr ive{X)g + \^S0{X)^\) TrX'^^ = 5f | , (44) 
C^'^'ig) = Ci^\9) = Cl{g). (45) 

Furthermore, the hounds CQ{g) and Cg'"'{g) can be attained at all points 6 by a single estimator in 
the respective classes. 

From this theorem, we can consider that Cglg) and Cg{g) represent the bounds of aeeiiraey of 
quantTim estimation. The former represents the bound with no quantum correlation in measuring 
apparatus, while the latter represents the bound when the quantum correlation is positively used in 
measuring apparatus. It is known that they coincide when d = 1 (Helstrom [42]) or when pg is the 
pure state (Matsumoto [63]). 

To analyze these values further, we introduce the symmetric logarithmic derivative (SLD) Lj;-^ 
and the SLD Fisher information matrix jg s as follows: 

:= Lk-e o pg, jk,l;9,S •■= Tr pe(-Ljfc;e o Li-g). (46) 

When d = 1, we can show Cq (g) = j^g = Cg^g). Here, we define a superoperator Dg as what 
satisfies TT{De{X) o Y)pe = -i Tt[X, Y]pg. If ah operators Dg{Ll), . . . , Dg{L'^) can be expressed as 
a linear combination of Lg, . . . , L^, then 

C^{g) = tr {v0{Lg')g + \^S0{Lg')^\) , (47) 
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where we define L^^''^ := Yli{j0^s)'''^^l;0- Then, 

ve{L^') + seiL,^) = j^^^ (48) 

holds. In particular, for the family of the quantum Gaussian states, this condition holds and we can 
check that TT{vg + \y^s^/g\) is equal to the bound Cg^g). Then, there is no difference between Cg{g) 
and Cg{g). Furthermore, if sg{Lg^) = holds, then we can show 

C^{g)=tvgjg^S- (49) 
On the other hand, Gill and Massar [15] proved the inequality 

trje;^j^<dimK-l. (50) 

_ 1 _ 1 

In particular, for dim 7^ = 2, the right-hand side is 1. When Jqs^^^os '^^ ^ one-dimensional projec- 
tion, the measurement M can be realized by the spectral decomposition of the Hermitian matrix that 

_ 1 _ 1 

is the eigenvector with the eigenvalue 1 of 3qs^^^0S- Hence, when jg^j^ = 1, this measurement 
can be constructed by a probabilistic combination of the measurements above. Thus, the set of all 
realizable measurements is {M\tT jg^jg^ = 1}. Using Lagrange multiplier method, we obtain 

Clig)=(t.^jgfgjgfy. (51) 

For two parameters, Nagaoka [66] proved (51), while for three parameters Hayashi [20, 21] proved it 
first using the duality theorem of linear programming. We can further prove the following lemma: 

Lemma 4 When dim 7^ = 2, for g > 0, 

Cl{g) = C^{g) (52) 
holds if and only if d = 1 holds or d = 2 and pg is a pure state. 

An outline of the proof of Theorem 3: We will prove the theorem by showing some inequalities. 
The most difficult part lira nCg{g) < Cg{g) will be proved in a later section because of not only its 
difficulty but also its connection with the quantum central limit theorem. 

First, we will prove Cg^'ig) < Cl{g). By selecting a POVM Mg satisfying tr^jg^y^g = C^(flf), 
the bound can be attained at 6. From this fact, we will show Cg'"'{g) < Cg{g) as follows: We 
measure the first ^/n states with a POVM M' satisfying jg^' > for all 6, and we estimate the 
unknown parameter by the maximum likelihood estimator based on ^/n data. We measure each 
of the remaining n — ^Jn systems by the measurement M^. Then, the bound Cg{g) can be attained 
uniformly for all [15, 34]. This shows that the bound Cg{g) can be uniformly attained by an adaptive 
POVM. Applying this argument to the system H®'^, we can prove Cg{g) < mC]^{g) and we can show 
that mCg^{g) can be attained for all 0. Then, letting m ^ oo, we get Cg{g) < lim^^oo 'mC]^{g). 
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From the above, we obtain (44) if nCg{g) > Cg{g) and Cg{g) > Cq {g) are proved. It requires 
several pages to prove these inequahties rigorously, so we will write only an outline. Letting := 
Jjjd 9^M"'{d6), wc get the former for n = 1. Then, the former inequality for arbitrary n follows from 
the fact that the vector X minimizing tr [v0{X)g + \y/gs0{X)^\^ in (44) belongs to the range of 
the superoperator Dg whose domain is the linear space spanned by Li, . . . , L^. The latter is proved 
as follows: First, the asymptotic local unbiasedness concerning M is defined by extending the local 
unbiasedness. Then, we prove that the asymptotic local unbiasedness at 6 implies the asymptotic 
local covariance at 9. An inequality similar to the Cramer-Rao inequality is derived by using the 
Schwarz inequality. Similarly, we can show C^^'^^g) > C0{g). Therefore, we obtain (45). 

We will explain the condition of local unbiasedness a little. In fact, it has a clear meaning to 
minimize the mean-square error under the local unbiasedness condition while this condition is not 
so natural. This is because this minimum mean-square error coincides with the minimum mean- 
square error under the more natural condition, i.e., the asymptotically locally covariance. This causes 
from the relation between the minimum mean-square error and the Fisher information matrix jgf for 
a fixed measurement M. However, if we adopt an error different from the mean-square error — for 
example, the absolute value of the difference from the true parameter or its s( < 2)th power as a loss 
function — then such meaning cannot hold. In particular, if ps is a pure state, then the infimum of 
such an error is under the local unbiasedness condition [21]. Therefore, there is no correspondence 
of this infimum with the error of the optimal covariant estimator given in Hayashi [22], which is the 
optimum concerning the family of all pure states in a finite-dimensional system under the covariant 
or the minimax criterion. 

11 Estimation with quantum correlation of the family of the one- 
mode quantum Gaussian states 

We will discuss the simultaneous estimation of ( and N in the family of the one-mode quantum 
Gaussian states {P(;,n\C € C, TV > 0} given in §4. In this model, the optimal measurement can be 
physically realized and a lower bound can be explicitly obtained. Hence, we can understand "How 
does the quantum correlation effect the quantum estimation?" When the size of noise N is known, as 
we discussed in §9, 

C^'" = Cf = 2{N + 1) 

holds, so there is no advantage to use quantum correlation. When we perform the heterodyne mea- 
surement (10), we can construct an estimator attaining this lower bound [50]. 
Now, consider estimating not only ( but also A^". Then, the relation 

> ^C,N = {N + 2){N + 1) = 2(A^ + 1) + N{N + 1) 
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holds for the family of density operators := {/9^,Ar|C € C, A'' > 0}, so we obtain an effect to use 
quantum correlation [24, 29]. 

Regrettably, the value of C^'^ has not been obtained concretely. Here, using a different method, 
we will evaluate the difference between presence and absence of quantum correlation. Restricting the 
sum of the mean-square errors for a complex parameter ^(= + ^2*) to be ^^^^p^, we will focus on 
the minimum mean-square error concerning the other parameter N. This minimum value (per one 
prepared state) is {N + 1)^ without quantum correlation, while it is asymptotically equal to A^(A^ + 1) 
with quantum correlation. In this setting, the heterodyne measurement (10) is the optimum for the 
former, while the measurement in the following procedure is the optimum for the latter: 

First, using an adequate interaction among n systems, we will transform the system as 

^C,-^V PV^CN Po,N ■ 

Then, we perform the heterodyne measurement (10) for the first system, and estimate the parameter ^ 
by the observed value. We measure the photon numbers (9) for the other n—1 systems, and estimate 
the other parameter N by its mean. For example, for n = 4, the proposed measurement can be 
realized by combining half mirrors as the following Figure 2: 
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where N and H respectively signify number measurement (9) and heterodyne measurement (10), 



while HM signifies a half mirror: precisely, HH is 180-degree rotation of a half mirror 



HM 



12 The quantum central limit theorem and the bound of the esti- 
mation error 

Using the quantum central limit theorem, we will construct a sequence of locally unbiased estimators 
that asymptotically attains CQ{g). First, we select a sequence of operators X = {X^, . . . ,X^) on H 
satisfying 

C^{9) = tr {vo{X)g + \^se{X)^\) , TrX'^^ = 6^ 
and define an operator Sn,Ry on H®" as 

Sn,R,v' := [ T^y{X(^^)dx. (53) 

Then, we define a POVM M" = {^"(^)}||0||<fl/^ by 

x,«(0) := V^S:;^l,T^^^^,ix(-^)S-^/l,. (54) 

Here, as we will discuss later, for v' satisfying v' + isg{X) > 0, there is a sequence {Rn{ > 0)} such 
that 

hm = S^, lim nv^'^iMl^, ^ ^ ) = v''\X) + v'"^'. (55) 

Now, we define := " agi and we introduce another outcome 9 := 9. The estimator 

ivr^i X R„ "^ith the outcome 9' is locally unbiased. Its covariance matrix is A~^V0{M^, xi?„)(^n^)* 
and it converges to V0{X) + v'. 

Now, we select v' by v' = y/g^^ {jy/gseiX)^\ + e) ^/g~^ for any e > 0. Then, by taking the 
limit e — > 0, it attains Cg{g). Here, we cannot substitute into e directly. This is because there is 
an eigenvalue 1 in the matrix \(v'y^ s{v'y^, which is the variable of / in the definition of T^g^, 
at (19). In fact, it diverges when the variable of / is 1. 

We will write an outline of the proof of (55). Essentially, (55) follows from Theorem 2, but 

— 1/2 

treatment of ^ ^, is somewhat difficult. We need to show that this operator converges to / in some 
sense asymptotically in a neighborhood of To show this, in this context, we can check that it is 
enough to prove that Trp|'"(/ — Sn,R,v')'^ converges to by technical treatments of inequalities. This 
convergence follows from Theorem 2 and its small extension. 

The point of this proof is to derive the convergence with respect to probability distributions from 
the central limit theorem guaranteeing the convergence with respect to algebraic structure (Theo- 
rem 1). Since it was easier to extend the central limit theorem to quantum systems based on the 
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algebraic structure, Theorem 1 was obtained first. However, the original central limit theorem guar- 
antees that the distribution converges to the normal one, so its quantum extension should guarantee 
the convergence with respect to distributions in some sense. The argument presented in this section 
says that simultaneous distributions by a quantum measurement converge to the simultaneous distri- 
bution by the corresponding quantum measurement under the family of the quantum Gaussian states. 
This can be called a quantum version of the central limit theorem satisfying the condition above. 
However, since the proof is complicated and is not straightforward enough, it is understood that there 
is a possibility for improvement. There might be a quantum central limit theorem in a more natural 
form. Attainment of the asymptotic bound described here might follow from such a theorem and we 
can expect development toward this direction. 

13 Future prospects 

In this article, to construct an estimator that works uniformly efficiently for an arbitrary family of 
quantum states, we featured a two-stage method of estimation: First, we estimate the parameter with 
-y/n states, and we make measurement working locally efficiently for the rest of the states. Since such an 
estimator is not so natural, we hope that a more natural estimator will be constructed. An estimator 
{M'^{6)}g that constructed below is expected to work uniformly efficiently, though its asymptotic 
performance has not be analyzed enough. For {fleje, we define 

•dpe 



Xg := argmin |tr {v0{X)gg + \y/g^S0{X)y/g^\ 

Sn,vg '■= / Txw{Xg ^)dd, 
Je 

M^e) := S-^'t^^^, {XP)S-X'. (56) 



Then, we conjecture that 

lim nveiM"") = vg{X) + vL (57) 

holds. If we exchange the orders of the limits without checking its validity, we can derive (57) similarly 
to the argument in §12. Of course, (57) is only a conjecture because this derivation contains many 
operations that are not mathematically allowed. Therefore, we can expect that the bound would be 
attained by the estimator (56) with v'q := ^JgeT^ i\\f9e^eiXQ)^fgQ\ + e) ^Jgo~^ in the limit e — > 0. 

Furthermore, in classical systems, it is known that geometrical quantities such as curvatures play 
important roles in the asymptotic theory concerning higher-order errors, and the optimum higher- 
order errors can be geometrically characterized [1, 2]. Similarly, we can expect that higher error terms 
would be related to geometrical quantities in some sense in the quantum case. Hayashi [26] treated 
higher-order asymptotic theory only in restricted models and pointed out that higher-order errors are 
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related to a kind of geometrical quantity in these models. We hope that these facts will be analyzed 
from a more general viewpoint. 

In future, to develop studies in this field, knowledge of classical mathematical statistics is of course 
needed, but also mathematical technique peculiar to quantum systems is often needed as we mentioned 
in §2. Therefore, we need to grasp statistical inference of quantum systems not only from the closed 
viewpoint of mathematical statistics, but also from wider viewpoints including general information 
processing of quantum systems. Studies considering application to actual experiments are also needed. 

Finally, the author thanks the referee for useful comments. 

Additional note 

After the publication of the original Japanese version, two books treating quantum estimation were 
published [31, 30]. The book [31] consists of the preceding papers after Holevo's famous book[50]. 
It contains many references of this paper. In particular, it includes English translations of several 
important manuscripts originally written in Japanese. The other book [30] is a textbook covering 
quantum estimation and quantum information. Also one article [37] concerning quantum estimation 
was written. It rigorously treats the asymptotic quantum estimation in a quantum two level system 
(i.e., qubit system) and its relation to the estimation of the one-mode quantum Gaussian case. 
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